Search CORE

4,004 research outputs found

Exploratory Control with Tsallis Entropy for Latent Factor Models

Author: Donnelly Ryan
Jaimungal Sebastian
Publication venue
Publication date: 01/03/2024
Field of study

We study optimal control in models with latent factors where the agent controls the distribution over actions, rather than actions themselves, in both discrete and continuous time. To encourage exploration of the state space, we reward exploration with Tsallis entropy and derive the optimal distribution over states—which we prove is q-Gaussian distributed with location characterized through the solution of an BSΔE and BSDE in discrete and continuous time, respectively. We discuss the relation between the solutions of the optimal exploration problems and the standard dynamic optimal control solution. Finally, we develop the optimal policy in a model-agnostic setting along the lines of soft Q-learning. The approach may be applied in, e.g., developing more robust statistical arbitrage trading strategies

King's Research Portal

The CoNLL 2007 shared task on dependency parsing

Author: Hall Johan
Kübler Sandra
McDonald Ryan
Nilsson Jens
Nivre Joakim
Riedel Sebastian
Yuret Deniz
Publication venue
Publication date: 01/01/2007
Field of study

The Conference on Computational Natural Language Learning features a shared task, in which participants train and test their learning systems on the same data sets. In 2007, as in 2006, the shared task has been devoted to dependency parsing, this year with both a multilingual track and a domain adaptation track. In this paper, we define the tasks of the different tracks and describe how the data sets were created from existing treebanks for ten languages. In addition, we characterize the different approaches of the participating systems, report the test results, and provide a first analysis of these results

UCL Discovery

Hochschulschriftenserver - Universität Frankfurt am Main

Hedging Non-Tradable Risks with Transaction Costs and Price Impact

Author: Cartea Alvaro
Donnelly Ryan
Jaimungal Sebastian
Publication venue
Publication date: 01/01/2020
Field of study

A risk-averse agent hedges her exposure to a non-tradable risk factor

U

using a correlated traded asset

S

and accounts for the impact of her trades on both factors. The effect of the agent's trades on

U

is referred to as cross-impact. By solving the agent's stochastic control problem, we obtain a closed-form expression for the optimal strategy when the agent holds a linear position in

U

. When the exposure to the non-tradable risk factor

\psi(U_T)

is non-linear, we provide an approximation to the optimal strategy in closed-form, and prove that the value function is correctly approximated by this strategy when cross-impact and risk-aversion are small. We further prove that when

\psi(U_T)

is non-linear, the approximate optimal strategy can be written in terms of the optimal strategy for a linear exposure with the size of the position changing dynamically according to the exposure's "Delta" under a particular probability measure.Comment: Originally posted to SSRN April 27, 2018. Forthcoming in Mathematical Financ

arXiv.org e-Print Archive

Oxford University Research Archive

Path-Specific Objectives for Safer Agent Incentives

Author: Carey Ryan
Everitt Tom
Farquhar Sebastian
Publication venue
Publication date: 21/04/2022
Field of study

We present a general framework for training safe agents whose naive incentives are unsafe. As an example, manipulative or deceptive behaviour can improve rewards but should be avoided. Most approaches fail here: agents maximize expected return by any means necessary. We formally describe settings with 'delicate' parts of the state which should not be used as a means to an end. We then train agents to maximize the causal effect of actions on the expected return which is not mediated by the delicate parts of state, using Causal Influence Diagram analysis. The resulting agents have no incentive to control the delicate state. We further show how our framework unifies and generalizes existing proposals.Comment: Presented at AAAI 202

arXiv.org e-Print Archive

Association for the Advancement of Artificial Intelligence: AAAI Publications

Health surveillance during covid-19 pandemic : How to safeguard autonomy and why it matters

Author: Calvo Rafael
Deterding Christoph Sebastian
Ryan Richard
Publication venue: 'BMJ'
Publication date: 02/04/2020
Field of study

Crossref

Spiral - Imperial College Digital Repository

White Rose Research Online

Sustainable Rooftop Technologies: A Structural Analysis of Buildings at WPI

Author: Miranda Sebastian
Stokes Ryan
Taylor Ian Alexander
Publication venue: Digital WPI
Publication date: 26/04/2018
Field of study

This project evaluated the feasibility of the installation of sustainable rooftop technologies on selected buildings at Worcester Polytechnic Institute (WPI). This report includes the structural analysis and design of three sustainable rooftop technologies: solar panels, green roofs, and solar collectors. These technologies have the ability to save energy, while contributing to WPIs sustainability plan. Additionally, an economic analysis is prepared to show the simple payback periods of installing these sustainable rooftop technologies

DigitalCommons@WPI

Decision heuristic or preference? Attribute non-attendance in discrete choice problems

Author: Heidenreich Sebastian
Phimister Euan
Ryan Mandy
Watson Verity
Publication venue: 'Wiley'
Publication date: 16/06/2018
Field of study

Funded by National Clinical Assessment Service (NCAS) and Institute of Applied Health SciencePeer reviewedPostprin

Aberdeen University Research

Lower your guards: a compositional pattern-match coverage checker

Author: Graf Sebastian
Peyton Jones Simon
Scott Ryan G.
Publication venue: Association for Computing Machinery
Publication date: 11/08/2020
Field of study

KITopen

Morphologically Aware Word-Level Translation

Author: Copestake Ann
Cotterell Ryan
Czarnowska Paula
Ruder Sebastian
Publication venue
Publication date: 01/01/2020
Field of study

We propose a novel morphologically aware probability model for bilingual lexicon induction, which jointly models lexeme translation and inflectional morphology in a structured way. Our model exploits the basic linguistic intuition that the lexeme is the key lexical unit of meaning, while inflectional morphology provides additional syntactic information. This approach leads to substantial performance improvements - 19% average improvement in accuracy across 6 language pairs over the state of the art in the supervised setting and 16% in the weakly supervised setting. As another contribution, we highlight issues associated with modern BLI that stem from ignoring inflectional morphology, and propose three suggestions for improving the task.Comment: COLING 202

arXiv.org e-Print Archive

Repository for Publications and Research Data

Crossref